CDS

Accession Number TCMCG019C38070
gbkey CDS
Protein Id XP_022926858.1
Location join(1661224..1661305,1661636..1661707,1661855..1661905,1662036..1662125,1662571..1662688,1662767..1662844,1663039..1663072,1663156..1663224,1664062..1664133)
Gene LOC111433844
GeneID 111433844
Organism Cucurbita moschata

Protein

Length 221aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023071090.1
Definition DNA-directed RNA polymerases IV and V subunit 4-like isoform X1 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category K
Description DNA-directed RNA polymerases IV and V subunit
KEGG_TC -
KEGG_Module M00180        [VIEW IN KEGG]
KEGG_Reaction R00435        [VIEW IN KEGG]
R00441        [VIEW IN KEGG]
R00442        [VIEW IN KEGG]
R00443        [VIEW IN KEGG]
KEGG_rclass RC02795        [VIEW IN KEGG]
BRITE br01611        [VIEW IN KEGG]
ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K03012        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko00240        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko03020        [VIEW IN KEGG]
ko05016        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map03020        [VIEW IN KEGG]
map05016        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGGAGAAAGGCGAAAAGGGTTTTCCAGTGCAGAAGAAACCTGGAAAGTCTTCGCTCAAATCTTCGGCTTCCAAGGATGCTTCTCTAAAAGGAAAGGATGACAGTTTGTCGAAGTCAAAGAAGGGTAGGAAAGTCCAGTTCGATGCTCAAGGGTCAGTTGATGCGCAGATCAATCTTTCATTGAAATTCAGTGGAAAAAATGGTGACTTGGGTAAAGGAGGGAAAGGTATAAATGGTGGGAAGGCTTCTGTTTCAAAGGAACAACAACCGCTAGAACTGAAGATTGAGCAAGAACTTCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGTGAGGCTGCACAACTTTTACAGGGAATCCAAGATCAGATGGCTCTTCTATCAGCAGATCCAACCATCAAAATCCCTACGTCATTTGATCGGGGACTGCAATATGCTAAACGAGCCAACCACTATGTAAATACCGAGTTGGTTAGACCAGTTCTTGAAACCCTCAAGAAATATGGTGTAGCGGACAGTGAGATATGTGTGATTGCTAATGTCTGCCCAGACACTACTGATGAAGTTTTTTCTCTTGTTCCATCTTTGAAGAGCAAAAGAAGCAAGCTAACCGAACCTCTGAACAACGTCTTGATTGAGCTAGCCAAGCTAAAATCATCCTGA
Protein:  
MSEKGEKGFPVQKKPGKSSLKSSASKDASLKGKDDSLSKSKKGRKVQFDAQGSVDAQINLSLKFSGKNGDLGKGGKGINGGKASVSKEQQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIQDQMALLSADPTIKIPTSFDRGLQYAKRANHYVNTELVRPVLETLKKYGVADSEICVIANVCPDTTDEVFSLVPSLKSKRSKLTEPLNNVLIELAKLKSS